NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Spurious Rewards: Rethinking Training Signals in RLVR

Shao, Rulin; Li, Shuyue_Stella; Xin, Rui; Geng, Scott; Wang, Yiping; Oh, Sewoong; Du, Simon_Shaolei; Lambert, Nathan; Min, Sewon; Krishna, Ranjay; et al (June 2025, cs.AI)

Full Text Available
Trusting Your Evidence: Hallucinate Less with Context-aware Decoding

Shi, Weijia; Han, Xiaochuang; Lewis, Mike; Tsvetkov, Yulia; Zettlemoyer, Luke; Yih, Wen-tau (June 2024, NAACL)

Language models (LMs) often struggle to pay enough attention to the input context, and generate texts that are unfaithful or contain hallucinations. To mitigate this issue, we present context-aware decoding (CAD), which follows a contrastive output distribution that amplifies the difference between the output probabilities when a model is used with and without context. Our experiments show that CAD, without additional training, significantly improves the faithfulness of different LM families, including OPT, GPT, LLaMA, and FLAN-T5 for summarization tasks (e.g., 14.3{\%} gain for LLaMA in factuality metrics). Furthermore, CAD is particularly effective in overriding a model{'}s prior knowledge when it contradicts the provided context, leading to substantial improvements in tasks where resolving the knowledge conflict is essential.
more » « less
Full Text Available
Do Membership Inference Attacks Work on Large Language Models?

Duan, Michael; Suri, Anshuman; Mireshghallah, Niloofar; Min, Sewon; Shi, Weijia; Zettlemoyer, Luke; Tsvetkov, Yulia; Choi, Yejin; Evans, David; Hajishirzi, Hannaneh (October 2024, COLM)

Full Text Available
SILO Language Models: Isolating Legal Risk In a Nonparametric Datastore

Min, Sewon; Gururangan, Suchin; Wallace, Eric; Shi, Weijia; Hajishirzi, Hannaneh; Smith, Noah; Zettlemoyer, Luke (May 2024, ICLR)

Full Text Available
CopyBench: Measuring Literal and Non-Literal Reproduction of Copyright-Protected Text in Language Model Generation

https://doi.org/10.18653/v1/2024.emnlp-main.844

Chen, Tong; Asai, Akari; Mireshghallah, Niloofar; Min, Sewon; Grimmelmann, James; Choi, Yejin; Hajishirzi, Hannaneh; Zettlemoyer, Luke; Koh, Pang Wei (January 2024, Association for Computational Linguistics)

Full Text Available
CREPE: Open-Domain Question Answering with False Presuppositions

https://doi.org/10.18653/v1/2023.acl-long.583

Yu, Xinyan; Min, Sewon; Zettlemoyer, Luke; Hajishirzi, Hannaneh (January 2023, ACL)

Full Text Available
Z-ICL: Zero-Shot In-Context Learning with Pseudo-Demonstrations

https://doi.org/10.18653/v1/2023.acl-long.129

Lyu, Xinxi; Min, Sewon; Beltagy, Iz; Zettlemoyer, Luke; Hajishirzi, Hannaneh (January 2023, ACL)

Full Text Available
Toward Human Readable Prompt Tuning: Kubrick’s The Shining is a good movie, and a good prompt too?

https://doi.org/10.18653/v1/2023.findings-emnlp.733

Shi, Weijia; Han, Xiaochuang; Gonen, Hila; Holtzman, Ari; Tsvetkov, Yulia; Zettlemoyer, Luke (January 2023, Association for Computational Linguistics)

Large language models can perform downstream tasks in a zero-shot fashion, given natural language prompts that specify the desired behavior. Such prompts are typically hand engineered, but can also be learned with gradient-based methods from labeled data. However, it is underexplored what factors make the prompts effective, especially when the prompts are in natural language. In this paper, we investigate common attributes shared by effective prompts in classification problems. We first propose a human readable prompt tuning method (FluentPrompt) based on Langevin dynamics that incorporates a fluency constraint to find a distribution of effective and fluent prompts. Our analysis reveals that effective prompts are topically related to the task domain and calibrate the prior probability of output labels. Based on these findings, we also propose a method for generating prompts using only unlabeled data, outperforming strong baselines by an average of 7.0{\%} accuracy across three tasks.
more » « less
Full Text Available
Towards Understanding Chain-of-Thought Prompting: An Empirical Study of What Matters

https://doi.org/10.18653/v1/2023.acl-long.153

Wang, Boshi; Min, Sewon; Deng, Xiang; Shen, Jiaming; Wu, You; Zettlemoyer, Luke; Sun, Huan (January 2023, Association for Computational Linguistics)

Full Text Available
Nonparametric Masked Language Modeling

https://doi.org/10.18653/v1/2023.findings-acl.132

Min, Sewon; Shi, Weijia; Lewis, Mike; Chen, Xilun; Yih, Wen-tau; Hajishirzi, Hannaneh; Zettlemoyer, Luke (January 2023, ACl Findings)

Full Text Available

« Prev Next »

Search for: All records